Using accent information in ASR models for Swedish
نویسنده
چکیده
A common technique to cope with the large variability in the acoustic realisations of the phonetic classes in speech, is to partition the data according to a linguistically significant variable. In this work, accent dependent phonetic models were trained and used both as an analysis tool for pronunciation variation and in the attempt to improve ASR performance. The Idea Accent dependent training The database is partitioned into accent areas. Accent dependent phonetic models are trained independently. speech database phone1 phoneN accentj accent dependent training accent dependent model set Accent analysis ⋆ The model parameters obtained this way, represent the statistical variation of the acoustic features across accent areas. This information can be used for pronunciation variation analysis. phonei clustering visual analysis
منابع مشابه
Accent Clustering in Swedish Using the Bhattacharyya Distance
In an attempt to improve automatic speech recognition (ASR) models for Swedish, accent variations were considered. These have proved to be important variables in the statistical distribution of the acoustic features usually employed in ASR. The analysis of feature variability have revealed phenomena that are consistent with what is known from phonetic investigations, suggesting that a consisten...
متن کاملImproving ASR performance on non-native speech using multilingual and crosslingual information
This paper presents our latest investigation of automatic speech recognition (ASR) on non-native speech. We first report on a non-native speech corpus an extension of the GlobalPhone database which contains English with Bulgarian, Chinese, German and Indian accent and German with Chinese accent. In this case, English is the spoken language (L2) and Bulgarian, Chinese, German and Indian are the ...
متن کاملPhonological features in the bilingual lexicon: Insights from tonal accent in Swedish
Scandinavian languages like Swedish employ tonal accent as a lexical phonological feature, where suprasegmental information can be the sole factor differentiating between words. Using cross-modal semantic fragment priming we tested the following: (a) Do monolingual speakers of Swedish use tonal accent information during lexical access? (b) Do bilingual speakers, who grew up with one tonal (Swed...
متن کاملModelling Accents for Automatic Speech Recognition
Accent is cited as an issue for speech recognition systems. If they are to be widely deployed, Automatic Speech Recognition (ASR) systems must deliver consistently high performance across user populations. Hence the development of accentrobust ASR is of significant importance. This research investigates techniques for compensating for the effects of accents on performance of Hidden Markov Model...
متن کاملAcoustic model selection for recognition of regional accented speech
Accent is cited as an issue for speech recognition systems [1]. Research has shown that accent mismatch between the training and the test data will result in significant accuracy reduction in Automatic Speech Recognition (ASR) systems. Using HMM based ASR trained on a standard English accent, our study shows that the error rates can be up to seven times higher for accented speech, than for stan...
متن کامل